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270 1 TTATTATCCAGAACATCTA GTTAATCATTACTT CCAAACTAGA CACTATTTACAC ACTCT 

HNF1 HNF3 



276 1 ATGG AAGGCGGGTA TA TTATATAA GAGAGAAACAACACATAGCGCCTCATTTTGTGGGTC 

SpL TBP RN A Start 



282 1 ACCATATTCTTGGGAACAAGATCTACAGCATGGGGC 

PreS I protein start 





approved jO.G. FIG. 



BY 


'pi -.cc 


CI IRpl ACQ 


[draftsman 







3 [ S V 



£ i 3 

'ares 1 



pa 

Sew 



■JSC 7? 



7500 



6500 



5500 



£ 

GL 
U 

o 

03 



4500 - 



3500 -- 



2500 




AGACACTATTTACACACTC/HNF3-wt 
GGTATATTATATAAGAGAG/TBP-wt 
GGAATATGCGCGCCGAGAG/TBP-mut 
CTAGTTAATCA7TACTTCC/HNF1 -wt 
CTATCGCCGACGGCAGTCC/HNF1-m 
CTAGTTAATAATTAATTCC/HNF1-21X 



1500 




400 



300 

[n(Vl competitor duplex] 



500 



-r 

600 



700 



Fig. 2 



DRAFTSMAN 



1081 CTA AGC AGG CTT TCA CTT T CT CGC CAA CTT ACA AGG CCT TTC TGT GTA AAC AAT 

NF1 (1100-1119) 

2c (1119-1134) 
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CAGCTGGG CCGCCCTTGT 

GCGGGGGCCA ACGCGATTGT GGGTGCTCGG 

CCCTGCTGGG GCAACCCATC GCTCCCCATG 

CCCGGAATAT TAGTAATCCT AATTCCCGGC 

GAAAGGTGGG GGTGGGGGGG GTCGCATCTT 

CTTTTTCTAT CAGTTTTCTT TGAGCTTTTA 

CTGAACTATA TTCAAAAGGA AGTAAATGAA 

AAAATAAAAA CAAAAGTTAA GACAGTAAAA 

ACAGAACCTG TAATTTTAAA AACTGTGTAT 

ATATTGGGGA CCCTCTCATG TAACCACGAA 

GTACACTCGT TTGTTTAATT GATAATTGTT 

CACGCTCACG AATTCAGTCC CAGGGCAAAT 

ACAAAACCAA TTAGGAACTT CGGTGGTCTT 

CAATTTAATT TCTTTTTTAA TTAAAAAAAA 

CTTTCCATTC AGAGGTGTGT TTCTCCCGGT 

CAGTTGGGGA CCCCCGCAAG GACCGACTGG 

CAGGCTAGAA GGACAAGATG AAGGAAATGC 

TTCGGGCATT TATTTTATTT TATTTTTTGA 

ACTTTTAGGG TTACCCCCTT GGGCATTTGC 

TGCACAGGGG TTGTGTGCCC GGTCCTCCCC 

TTACACGTGT TAATGAAAAT GAAAGAAGAT 

CGCCCGTGGG TGCCCTCGTG GCGTTCTTGG 

GGGTGTCGCC GCGCCCCAGT CACCCCTTCT 

GCCTTCCTAG TTGTCCCCTA CTGCAGAGCC 

ACCCACTCGA GGCGGACGGG GCCCCCTGCA 

AGCGGGGCGA TTTGCATTTC TATGAAAACC 

GGCGCGGCGC CTCAGGGATG GCTTTTGGGC 

CCCGCGCCCC CTCCCCCTGC GCCCGCCCCC 

CTTTGATCTT TGCTTAACAA CAGTAACGTC 

TGCAAAGTCC TGGAGCCTCC AGAGGGCTGT 

CACGCTCCGG CGAGGGGCAG AAGAGCGCGA 

AGCGCGGACC CAGCCAGGAC CCACAGCCCT 



GCGCGGGCTG ATGCTCTGAG GCTTGGCTAT 
GGAGT GGGGG GGGGCACGAC CGTAGGTGCT 
CGGAATCCGG GGGTAATTAC CCCCCCAGGA 
GGGGGAGGGG GCGCGGGAGG AATTCACCCT 
GCTGTGAGCA CCCTGGCGAA GGGGAGAGGG 
CTGTTAAGAG GGTACGGTGG TTTGATGACA 
CAGTTTTCTT AATTTGGGGC AGGTACTGTA 
TGTCCTTTTA TTTTTTAATG CACCAAAGAG 
TTTAATTTAC ATCTGCTTAA GTTTGCGATA 
CACCTATCGA TTTTGCTAAA AATCAGATCA 
CTGAATTATG CCGGCTCCTG CCAGCCCCCT 
TCTAAAGGTG AAGGGACGTC TACACCCCCA 
GTCCCAGGCA GAGGGGACTA ATATTTCCAG 
TGAGTCAGAA TGGAGATCAC TGTTTCTCAG 
TAAATTGCCG GCACGGGAAG GGAGGGGGTG 
TCAAGGTAGG AAGGCAGCCC GAAGAGTCTC 
TGGCCACCAT CTTGGGCTGC TGCTGGAATT 
GCGAGCGCAT GCTAAGCTGA AATCCCTTTA 
AACGACGCCC CTGTGCGCCG GAATGAAACT 
GTCCTTGCAT GCTAAATTAG TTCTTGCAAT 
GCAGTCGCTG AGATTCTTTG GCCGTCTGTC 
AAATGCGCCC ATTCTGCCGG CTTGGATATG 
CGTGGTCTCC CCAGGCTGCG TGCTGTGCCG 
ACCTCCACCT CACCCCCTAA ATCCCGGGGG 
CCCCTCTTCC CTGGCGGGGA GAAAGGCTGC 
GGACTACAGG GGCAACTCCG CCGCAGGGCA 
TCTGCCCCTC GCTGCTCCCG GCGTTTGGCG 
GCCCCCCTCC CGCTCCCATT CTCTGCCGGG 
AC AC GG ACT A CAGGGGAGTT TTGTTGAAGT 
CGGCGCAGTA GCAGCGAGCA GCAGAGTCCG 
GGGAGCGCGG GGCAGCAGAA GCGAGAGCCG 
CCCCAGCTGC CCAGGAAGAG CCCCA 
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10 

GAATTCACTG 
CTTAAGTGAC 



20 

GGGAGAGCAT 
CCCTCTCGTA 



30 40 50 

TCAGGAAGAT GACAACAGGA TAATAGGTCA 
AGTCCTTCTA CTGTTGTCCT ATTATCCAGT 



60 70 
ACAGAGTAAT AGAGAGGTCG 
TGTCTCATTA TCTCTCCAGC 



80 

CTAAAAATAA 
GATTTTTATT 

150 

GGGAATATTG 
CCCTTATAAC 

220 

AACAACTGTT 
TTGTTGACAA 



90 

ACTCTAAGAA 
TGAGATTCTT 

160 

TGGGCAGAAG 
ACCCGTCTTC 

230 

TTTTCAAGTT 
AAAAGTTCAA 



100 

GTATTCAGCC 
CATAAGTCGG 

170 

TCAGACTGTA 
AGTCTGACAT 

240 

GGTCACGTGA 
CCAGTGCACT 



110 

AAAACTATTA 
TTTTGATAAT 

180 

GGAGGCTGGG 
CCTCCGACCC 

250 

ACAAATCTGT 
TGTTTAGACA 



120 

TTGAGCTAAT 
AACTCGATTA 

190 

GATCAAGAAG 
CTAGTTCTTC 

260 

GACCTTCAGC 
CTGGAAGTCG 



130 

AATGGTGGGA 
TTACCACCCT 

200 

TTGAGGCAAG 
AACTCCGTTC 

270 

CTCCCCTCCC 
GAGGGGAGGG 



140 

TCAATTTCAG 
AGTTAAAGTC 

210 

GAGGTTGGAC 
CTCCAACCTG 

280 

TCGGGTCTTG 
AGCCCAGAAC 



290 300 310 320 330 340 350 

GCTGAGCTGA TTGCAGGGCC CCTGCAGCTC TGGCACTCTC AAGTTGTATA AAACTGACAG TGCAGAAGTC 
CGACTCGACT AACGTCCCGG GGACGTCGAG ACCGTGAGAG TTCAACATAT TTTGACTGTC ACGTCTTCAG 
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360 370 380 390 400 410 420 

CTTGAGCCCA TTTTGGCTCT CATGATAATT TTCCTTCAGT GGAACTAAGG TTACTTGTCT AAGAACCAAA 
GAACTCGGGT AAAAC C GAGA GTACTATTAA AAGGAAGTCA CCTTGATTCC AATGAACAGA TTCTTGGTTT 



430 440 450 

GCCTCTGACT TGACTGATCA AAGTTCATCA 
CGGAGACTGA ACTGACTAGT TTCAAGTAGT 



500 510 
CTACATAGAT CTGGGCCCAG 
GATGTATCTA GACCCGGGTC 



520 

GACAGGATGC 
CTGTCCTACG 



460 470 480 490 

CGTGCATCGA AGCCACCTAC TTGGCAGATG TAGTGAAAAG 
GCACGTAGCT TCGGTGGATG AACCGTCTAC ATCACTTTTC 

530 540 550 560 

TGGGGCGTGG GAGGGGAAGA AAGCAGGTGC TAACTATATA 
ACCCCGCACC CTCCCCTTCT TTCGTCCACG ATTGATATAT 



gttta 



C 3 



570 580 590 600 610 620 630 

GATAGCATGC CTATCAGAGC AGTTTTTACG TTTCCTATTT GTCTCTCAAA ACAATTTTAT AGGAATCATC 
CTATCGTACG GATAGTCTCG TCAAAAATGC AAAGGATAAA CAGAGAGTTT TGTTAAAATA TCCTTAGTAG 



640 650 660 

AAAGCAATTT TATCATGGTT TCTAGACCAG 
TTTCGTTAAA ATAGTACCAA AGATCTGGTC 



670 680 
GTTTGGATGT GAGGTAGGGA 
CAAACCTACA CTCCATCCCT 



690 700 
TTTCCACAGC TGCTTTTAGT 
AAAGGTGTCG ACGAAAATCA 



710 

TTGAAGGAAA 
AACTTCCTTT 

780 

AATTCATTGT 
TTAAGTAACA 

850 

AGTTGGCAAT 
TCAACCGTTA 



720 

TCTGATAAGA 
AGACTATTCT 

790 

C AAAAC T T AA 
GTTTTGAATT 

860 

TATTTGTTAA 
ATAAACAATT 



730 

TGATGCAAAA 
ACTACGTTTT 

800 

GGTGTTTTTA 
CCACAAAAAT 

870 

ACTCATGTCT 
TGAGTACAGA 



740 

GCCCTTCAGA 
CGGGAAGTCT 

810 

ATATTGTTAT 
TATAACAATA 

880 

TAGGCTAAAT 
ATCCGATTTA 



750 

AATGTGTAAT 
TTACACATTA 

820 

TGTTCATTTG 
ACAAGTAAAC 

890 

AAATTCCAAA 
TTTAAGGTTT 



760 

CCTACACACT 
GGATGTGTGA 

830 

GTTTTTACCA 
CAAAAATGGT 

900 

AAATTCAGGA 
TTTAAGTCCT 



770 

TCAGTGATTC 
AGT C ACT AAG 

840 

AC AT GT AAGG 
TGTACATTCC 

910 

TGAGAATTGT 
ACTCTTAACA 
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920 930 940 

TTATTGCTTA ACGTGTGTCA AATTTCTTCC 
AATAACGAAT TGCACACAGT TTAAAGAAGG 



950 960 970 980 

ATGCACATCT TTATTAGATC TTCACAGCAA CCTACAGGAT 
TACGTGTAGA AATAATCTAG AAGTGTCGTT GGATGTCCTA 



990 1000 
AAGCAAGACA GGTGCAAGTG 
TTCGTTCTGT CCACGTTCAC 



1010 
CCTCCTTTGG 
GGAGGAAACC 



1020 1030 
GTATGAGGAA ACTGAGGTCT 
CATACTCCTT TGACTCCAGA 



1040 1050 
AAAGAGATGA AGTGATTTGC 
TTTCTCTACT TCACTAAACG 



1060 1070 
CCAAGGCTCA TAGCAATTTA 
GGTTCCGAGT ATCGTTAAAT 



1080 
TTGGTAGAGC 
AACCATCTCG 



1090 1100 
AAAGACTAGA ATTCTCTTAA 
TTTCTGATCT TAAGAGAATT 



1110 1120 
CTGCAGCCTA TTTTCCC'TAT 
GACGTCGGAT AAAAGGGATA 



1130 1140 
TCTGAACTGT TACATCAGCA 
AGACTTGACA ATGTAGTCGT 



1150 
TCAACAATTA 
AGTTGTTAAT 



1160 1170 
TCTAATGGAT TGGAACAGTG 
AGATTACCTA ACCTTGTCAC 



1180 1190 
TACACAGGCA GCTTAGCTAC 
ATGTGTCCGT CGAATCGATG 



1200 
GTCAAGTCAC 
CAGTTCAGTG 



1210 1220 
GATTTTTACT TTAACTTCAA 
CTAAAAATGA AATTGAAGTT 



1230 1240 1250 1260 

TTCCAGAGTC TTGGCCTGAT TTCCCTCAAG ACCCTACTTA 
AAGGTCTCAG AACCGGACTA AAGGGAGTTC TGGGATGAAT 
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1270 1280 1290 1300 1310 1320 1330 

TCTTTGGCTT TGGAAAATTT ATTTTTCTTG CATTATCTTT CCAGCTAAAT TTTATTTAAT AACCATCAGC 
AGAAACCGAA ACCTTTTAAA TAAAAAGAAC GTAATAGAAA GGTCGATTTA AAATAAATTA TTGGTAGTCG 



1340 1350 
ATGCTTTTTT TGCTTTATGC 
TACGAAAAAA ACGAAATACG 



1360 1370 1380 

CATGTAGACT TGACCTGAAA ACCTGCCAGG 
GTACATCTGA ACTGGACTTT TGGACGGTCC 



1390 1400 
CTTTCATTGA GTTTAGTGAT 
GAAAGTAACT CAAATCACTA 



1410 1420 1430 1440 1450 1460 1470 

TAAAGAAGTA AAGTTCTGAG AAGCAATTAG TTGATGGGAC ACCAGTCATA AAATCAATCC AAACTTTTGT 
ATTTCTTCAT TTCAAGACTC TTCGTTAATC AACTACCCTG TGGTCAGTAT TTTAGTTAGG TTTGAAAACA 



1480 1490 1500 1510 1520 1530 

TGACATGTGT TTCTTTCTCC ATATACCAGG TTCCCGCTTC GTATTAGTAA GATTGAAATT 
ACTGTACACA AAGAAAGAGG TATATGGTCC AAGGGCGAAG CATAATCATT CTAACTTTAA 



1540 
GAAATAAGTC 
CTTTATTCAG 



1550 1560 1570 1580 1590 1600 1610 

TATTGCTGGT GGATGAATTT GTCACTTTCC TTGAAACTGG TGAACCCAAA AAGTTAGACA GTGATAGGAA 

ATAACGACCA CCTACTTAAA CAGTGAAAGG AACTTTGACC ACTTGGGTTT TTCAATCTGT CACTATCCTT 

1620 1630 1640 1650 1660 1670 1680 

AATACTGCCA TTGTCTGTTA AGAAGTCTAT GACATTTCAA GGCAAGAATG AATATATGGA AGAAGAAACT 

TTATGACGGT AACAGACAAT TCTTCAGATA CTGTAAAGTT CCGTTCTTAC TTATATACCT TCTTCTTTGA 

1690 1700 1710 1720 1730 1740 1750 

TGTTTCTTCT TTACTTACAA AAAGGAAAGC CTGGAAGTGA ATGATATGGG T AT AAT T AAA AAAAAAAAAA 

ACAAAGAAGA AATGAATGTT TTTCCTTTCG GACCTTCACT TACTATACCC ATATTAATTT TTTTTTTTTT 



1760 1770 
AAAACAAAAA ACCTTTACGT 
TTTTGTTTTT TGGAAATGCA 



1780 
AACGTTTTGC 
TTGCAAAACG 



1790 
TGGGAGAGAA 
ACCCTCTCTT 



1800 
GACTACGAAG 
CTGATGCTTC 



1810 
CACATTTTCC 
GTGTAAAAGG 



1820 
AGGAAGTGTG 
TCCTTCACAC 




1830 
GGCTGCAACG 
CCGACGTTGC 

1900 
CTCTGCCACC 
GAGACGGTGG 

1970 
CTTCTCCCCG 
GAAGAGGGGC 



1840 
ATTGTGCGCT 
TAACACGCGA 

1910 
TTCTCTGCCA 
AAGAGACGGT 

1980 
ATCTGCGGCC 
TAGACGCCGG 



1850 
CTTAACTAAT 
GAATTGATTA 

1920 
GAAGATACCA 
CTTCTATGGT 

1990 
ACTGGACTGC 
TGACCTGACG 



1860 
CCTGAGTAAG 
GGACTCATTC 

1930 
TTTCAACTTT 
AAAGTTGAAA 

2000 
CCATCAGCAT 
GGTAGTCGTA 



1870 
GTGGCCACTT 
CACCGGTGAA 

1940 
AACACAGCAT 
TTGTGTCGTA 

2010 
GAAAATTTTT 
CTTTTAAAAA 



1880 
TGACAGTCTT 
ACTGTCAGAA 

1950 
GATCGAAACA 
CTAGCTTTGT 

2020 
ATGTATTTAC 
TACATAAATG 



1890 
CTCATGCTGC 
GAGTACGACG 

1960 
TACAACCAAA 
ATGTTGGTTT 

2030 
TTACTGTTTT 
AATGACAAAA 



2040 
TCTTATCACC 
AGAATAGTGG 

2110 
TGAACCACAA 
ACTTGGTGTT 

2180 
GGATGATGGT 
CCTACTACCA 



2050 
CAGATGATTG 
GTCTACTAAC 

2120 
GCCTTTATTA 
CGGAAATAAT 

2190 
A G AAAC C AAA 
TCTTTGGTTT 



2060 
GGTCAGCACT 
CCAGTCGTGA 

2130 
ACTAAATTTG 
T GAT T T AAAC 

2200 
TAGAAGAATG 
ATCTTCTTAC 



2070 
TTTTGCTGTG 
AAAACGACAC 

2140 
GGGTCCTTAC 
CCC AGGAATG 

2210 
GTCTTGTGGC 
CAGAACACCG 



2080 
TATCTTCATA 
ATAGAAGTAT 

2150 
TAATTCATAG 
ATTAAGTATC 

2220 
ATAATGTTTG 
TATTACAAAC 



2090 
GAAGGCTGGA 
CTTCCGACCT 

2160 
GTTGGTTCTA 
CAACCAAGAT 

2230 
TTCCCTAGTC 
AAGGGATCAG 



2100 
CAAGGTAAGA 
GTTCCATTCT 

2170 
CCCAAATGAT 
GGGTTTACTA 

2240 
AATGAACTCT 
TTACTTGAGA 



2250 
CATATTCTTG 
GTATAAGAAC 

2320 
CCATACCATC 
GGTATGGTAG 



2260 
TCTCTGGTTA 
AGAGACCAAT 

2330 
TCTGTTATCC 
AGACAATAGG 



2270 
GGATCTTGGG 
CCTAGAACCC 

2340 
TGGGGCAAGT 
ACCCCGTTCA 



2280 
ATCTGGAGTC 
TAGACCTCAG 

2350 
GCCTCAGTTT 
CGGAGTCAAA 



2290 
AGACTGCCTG 
TCTGACGGAC 

2360 
CCACATCTGA 
GGTGTAGACT 



2300 
GGCTCAAATC 
CCGAGTTTAG 

2370 
GAAAT GGGGA 
CTTTACCCCT 



2310 
TTGGCTCTGC 
AACCGAGACG 

2380 
TGGTAGTGGT 
ACCATCACCA 



2390 

GTCCATTTCA TAG AT 
CAGGTAAAGT ATCTA 
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GAGATGTATATAATTTTTTAGGAAAATCTCAAGGTTATCTTTACTTTTTCTTA 
GGAAATTAACAATTTAATATT7VAGAAACGGCTCGTTCTTACACGGTAGACTTA 
ATACCGTAAGAACGAGCCGTTTTCGTTCTTCAGAGAAAGATTTGACAAGATTA 
^3 CCATTGGCATCCCCGTTTTATTTGGTGCCTTTCACAG7\AAGGGTTGGTCTTAA 
M TT 
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TCTAGAAAAT AATTCCCAAT ATTGAATCCC AAAGAATTCA ACATTTGGGC TGTCGTTTGA 61 

AAGATAAGTT GAATTTGGTC ATGAAGGAAG AGAGGGGGGA TACAATTTCA GTAAAAGGTA 121 

ACAGCAAGGT CCAAAGACAG TCAGGTCTTC AGTAGTATGG AGTATATTCA GAGGGAGCCA 181 

AGATGTCTGA TGTGAACTAA AAAGATTGGT GGTTGGTAGG AGGAAGAGGT GTGAGAAGAG 241 

GCTGTAAAGA AAAATTGAAA CTTGATTGTG ATGGACTTTA AAGGCTAGGC TATGGGACTT 301 

GGACATGAAT CTGCAGGCCA GTGTTTGCAG ACTGGCGCCC ATAACTGTCT ATCACAGCAA 361 

C AC AG AC AT G TGTTGTTTGG CCTGCAGAGG TTTGGCCTGC AT GAT GAT TT TAAACCATCT 421 

GAATTAGTAG CCATCATTTT CAAAAATCAA GAGATGCCAC AT T AAAAT AT GGAATGCTGC 481 

TGTTCTTGAA AAT AAT G AAA CATCTGGAAC ATTGAGGCCA CATTCCTGAC TGACAGCAAT 541 

CAGTTGGAGC TGCGTAGTGA CTGCCCACTT TACATGGGGC ATCTGATCCC TAGTCGATTA 601 

CAGCTGCCAC CACTTCCCTT TATCTCTCTA ATACCAAGCT CTTTTCACTC ATTTTTGTTA 661 

CTTAAGAGAT ATTTGGGTTT GAAACCTCTG ATGCAGGTAA TTGAGGGTTA TAGAGCAGAG 721 

GACAGATGCT ATCAGAGTTG TCTTTTAAGA AAGAACCCTC TGTTCTTCAT TTTGTTGAAG 781 

ATAGCCTGGA AGAGGGCAGC CAGGGGAGAA GTTAGGGCTG GAGCTATGAG AAAGCATAAG 841 

AT GAG AT GAT GGCTTCAACA TTGAGGACAG AAAGAATATT GAG AT GAGAA AGTAGTCCAT 901 

ATAAGCATCT ATGCAAAGGA AATAGCAGAT GTCCTCAAAT CAGCAGAGGC AACAACTCTG 961 

AAAGTTTATT CATAAGCCCC TCTTTTCATC TCCAATCCAG TTCAAATGTA ATTATTTAAA 1021 

TTGTTCTTCA CTCTCCTTCC TGGAT CATGA ATGAGCTCCT TAAATGCAGG GTCCACAGTG 1081 

TCCTATTCAT CAGTGAATTC CAAGTGCCTA GCACAGAGCC TGGCAAATAG TAAATGCTTA 1141 

ACAAATATTC GTTCAGTGCA TGAATTGGAG TGATTCTCTA CTTTGCCTCA TAAGTTGAAA 1201 

AAAGGTTTAT TACATACCTA AATATGCTGA AATCACAGGG CATTTGGCAA CCCCCCAAAA 12 61 

CCAAAACTCC CAGTTTGGAA ACAGAATTTT AATTCTGTGA AAATAAAATC CATTCATTTA 1321 

TTCAAAAAAT AT T TAT T AAA CAATGACCAT GTCCACACCA GGCTGAGTCC TAAGGATTCA 1381 

ATGATGAACA AAAACCAACA TGATTCCTGC TCTTAGGAAA CATACAGTTC AGTGAGGAAA 14 41 

ACAGATTGTG AGAAGTCCTC CAACAAATAC TGGGTGCTAT TAAAATATAT TAAAAGGTGA 1501 

GTGGGTGAGG GACTTGAGCT AGCCTAGGTG GTTCAGGAAG TCTTCCTGGA TGTGCTGATA 15 61 

TGCATAGGCA TTAACTAGAT AAATAGAGAG AAGGATGAAC CAACATTGCA GGTAGAGGGA 1621 

AC AGAAT AT G CAAAGGCAGG AAGGATTATG GAGTCGTTGG AGGACCTGAA TAAAGGCCCA 1681 

1= GTGTAAGTGG ATCTCAGAAA ACAGGAGGAA AGGTGTATGA GATGAGAT C A GAGAGGCAGA 17 41 

Ul . TCATGTGGGG TATGGTTAAT GTTTTGGACT TTTCTATTAA GAGCAATGGG GAGACAGTGA 1801 

yj CAGGACTTAA ACGGGGAAAT AAT AT GACCA GATTAAACTT TCTAAAAAAC CCTCTATGCA 18 61 

AATATATATT GAGAGTTAAT TAT TG AC AAA GATTCAAAGG CAACAAAGTG GAGAGAGAAT 1921 

Lh AGTATTTTCA AAAAATGGTG CCAAAACAAT AGGACATCTA TATTAAAAGT TGGGTATCTG 1981 

TCTACAAAAC TTAATTCAAA ATGGATCACA GACCTAAATG TAAAACTGAA AGCTATACAA 20 41 

CTTCTGGAAG GAAAACACAG ATGGGAATCT GTGTGATCTT GAGTTTGAAA AT GAT T TAT T 2101 

ATATCTGACA CCATAATCCG TAAGTTAACA TAATTCATAA GTGAACAAAG TGATGAACTG 2161 

GACTTCATCA GAATTTAAAA TGTTTGTGCT T C AAAAG AC A CTGGTATGAT AAT GAAGAC A 2221 

AACTACAGAT AAGAT AT TGT TGAAT CAT AT TTCTGATAAA GGAATTGTGG CTC AGAAT AC 22 81 

ATAACTCTAA ACCCCCATAA TAAATTACAA GTAGCCCAAT TAAAAAAAAA AAAAGAGAAA 2341 

AAATTTACAG TCTTCATCAA AGAAAGT AT C AATTGTAAAA TAAGCACATG AAAAATGCTC 24 01 

TGCATCTTTA TTCATGGGGG GATGAAATAA AAATTAAATG GGAAAGACAC CTCTAATTAG 24 61 

AATACTAAAA TTAAAAAGAC TGACCATACC AAGTATTGGT GAAGTGGAAA TGT AAAAT GA 2521 

TACAATCAAC TTAGGTAGAT GATTTGGAAG TTTCTTACAA AAGTAGGTGT ATACCTACCC 2581 

TGTGACTCAC CCATTCCATG GCTAAGTATT TACCTGAGAG AAAT GAAAGA ATACATCCAT 2641 

AC AAAGAT GT T TAT AC AAAT ATTTATAGCA GTTTTATTTG TAGTAGCCCC AAACTGAAAA 27 01 

GAACCCAAAT GTC CATC AAA AGTGAATGGA TAAACAAAGC GTGGTACAGC AAT GC AAT AG 27 61 

AATACTACTT AGCAATAAAG AAGAATGAGC TAGTGATATA CATAACAGCT T AAAT G T AC A 2821 

TCAAAGGCAT TGTGCTCAGT G AAAGAT GC A AGTAAAAAAA AAAAAGAGTA CATGCTGTAT 2881 

AGTTCCATTG ACATAAAACT CTGGAAAGTG AAAAACAGTC TATACTGACA GAAAGCAGAT 2 941 

CATTGGTTGC CTGAGGAGGA GGAGTATAGG AGAGGTGGAG GG AAAAT GT A CAAAGTGGCA 3001 

CAATAAAAAC TTTTGGAATC ATAGATATAT TCACTATCTT GATTGAGTGA TGATTTCATG 30 61 
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AGTGCACGTG CGTGTGTCAA AAATGATCAA TTTATGCAAC TTTAAATATG TGCAGTTTAT 3121 

TGTATATATC AATTATACCT CAGTACGGCT ATTAAAAAGA AACCCTCTGG CTGCACAATG 3181 

CAGAACTGAT TCTAGGAAAG AGTGGAGGGA GGATGACCAT TTACAGTGCT CCAGGTGGAA 32 41 

GAGAACGGTG CCTTCTGGAA GTGAACTAGG TTGGCAACAA CAGAGATGAA ATAAATGGGC 3 3 01 

AGATGTGTGA GATACTTAGG AAATAAAACC CGATGGTCAC CATTTTCCAA AGGTCAGCTC 33 61 

ATCCTGGCTT TCCAGAGCAA AGAGCTAGGG AAGACTTTAT TAATAAATCC CTCTTGAAGT 34 21 

TGCAGAGGAA GCTTATAGCA GAAACTTACT CTCAACCTGA CTAATCTGAG AGAACACCTC 3 4 81 

TGGTTCCATT TGATTACTAA AAAACTGCAA AGAACAGGAG GAGAAAGAAG AAGAAAGCTG 35 41 

GTACAAACAG TGAACTTATA TAATATTAAT CAATAATTGT CTCTTGTTCT TAAAAGCAAT 3 601 

GGGAAGAAAA TGAGATTTGA GCTGGAAGAT CAGAGTTCAA AATCCAAATA AAGTATATGG 3 6 61 

CCCTAATATG CTTATAGTAG TTAACCTTTC CTGATAATGA TATAATTGTT GACAGCACCA 37 21 

TCTTTAAAAT AAAATAACAT AGTAATCCTT CAGATTTGTA GAAGATCTTT CCTGTTTACA 37 81 

AGTTTGTTCT ATACACATTA TGTCTTTTAA ATGACACACT AGCCTTCTGA GGGTAACTTA 38 41 

TATTGGCAAC AGTTTTCAGA TGTGGAAACT GTGAAGACAA TGTTGGTGAT GTGGAAGCAA 3 901 

CATAAACTTT GGAGTCTTTC AGACCCAGGT TTGAATGTCA GACTGCTTTT TATTCAGAGT 3 9 61 

AACTTCAGAG CATTATTTCT CACCTTAATT TTTTTTCAGG CCTCTTTGTG TCTATGTGTC 4 021 

CTCTTCACTC CTGTCCATTG TTTCTTCAGT GATTTTTGCC ACCTTCCTTC ACTGTTAGTG 4 081 

TGTAGACACA TAGTTCTCCT GGCTCTGAGA GCCTATGTTA ATTCCATTCT ACCATCCTGC 4141 

CACGGCCCAC TCAATTCCTA TTGAGCAATG CTAGTTGAAA GTTGTGGTGG GATTAAATGT 4 201 

TGCAATGAGT ATTCAAATGA GGTTGAAGTA TCTACGCATT CTACTTACAT ATGGTGAGGT 4 2 61 

ATATTCAAGG AAGCTGTAGC CAT T AAAAT C TCAGGAAATA ATTTTTCACC TCCTCAGGTG 4 321 

AAAGGGTCTT CAGGCCTTTG TGTTCTGGAA GGTTCATTTA TAGCCATTTC CCAAATGACA 4 381 

O ATGCGATTGA TGAGTCTAGA GTCTAGCTCA AATAGCAATG GACTGGAAGA CTAGTTTAGG 4 441 

^ TTTTACTAAT GT GGAACAT A GAACAAATTA TGTCCTTGTT TCAGCCTGTT CATCTGTGAA 4 501 

ATAGAGCCTA TCATATCCAG TCTTCCTTGC CTTTAGGTTT GAGTTACCTT CTTTGGTCAA 4 5 61 

GGTAAGTAAA TGCCTATGAT GTTTGGCTGT GCACAAGATA AAGCTACAAC AAAGCTACAA 4 621 

CCCATCTTTT CTCTGTAGAA GACTCAAAAA GCAAAAGAGA CCCAGGAAAA TCTCGGAATG 4 681 

ACTTTTGGAA CAGAGAGCCT CCCCAGAATC AGAAGTCAAG GAATTTAAAC ATAGGGAAGG 4741 
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" CCCAGGTCTC TACTGACATA AAGGAAAGAT GTTTTCTTAT AGGTTTCACG TTTACATTTT 4 801 

CTCTCTCTTG ATCCCATTCC CACTTGCATC TGCCACCTTT ACACAGGGCT TATGGGACCT 4 8 61 

IjJ CCTCCACAAA AGAGCAGTTG CAGTAACCCA CATCATCCTC TACGCCCTGG CTGTCCATCA 4 921 

s AGAGGC G AAA AGCAGCCCTA TATAGGTTCT ATCCTTGGAT AGTTCCAGTT GTAAAGTTTA 4 981 

AAATATGCGA AGGCAACTTG GAAAAGCAAG CGGCTGCATA CAAAGCAAAC GTTTACAGAG 5041 

CTCTGGACAA AATTGAGCGC CTATGTGTAC ATGGCAAGTG TTTTTAGTGT TTGTGTGTTT 5101 

ACCTGCTTGT CTGGGTGATT TTGCCTTTGA GAGTCTGGAG AG T AG AAGT A CTGGTTAAAG 5161 

GAACTTCCAG ACAGGAAGAA GGCAGAGAAG AGGGTAGAAA TGACTCTGAT TCTTGGGGCT 5221 

GAGGGTTCCT AGAGCAAATG GCACAATGCC ACGAGGCCCG ATCTATCCCT ATGACGGAAT 5281 

CTAAGGTTTC AGCAAGTATC TGCTGGCTTG GTCATGGCTT GCTCCTCAGT TTGTAGGAGA 5341 

CTCTCCCACT CTCCCATCTG CGCGCTCTTA TCAGTCCTGA AAAGAACCCC TGGCAGCCAG 54 01 

GAGCAGGTAT TCCTATCGTC CTTTTCCTCC CTCCCTCGCC CCACCCTGTT GGTTTTTTAG 54 61 

ATTGGGCTTT GGAACCAAAT TTCCTGAGTG CTGGCCTCCA GGAAATCTGG AGCCCTGGCG 5521 

CCTAAACCTT GGTTTAGGAA ACCAGGAGCT ATTCAGGAAG CAGGGGTCCT CCAGGGCTAG 5581 

AGCTAGCCTC TCCTGCCCTC GCCCACGCTG CGCCAGCACT TGTTTCTCCA AAGCCACTAG 5 641 

GCAGGCGTTA GCGCGCGGTG AGGGGAGGGG AGAAAAGGAA AGGGGAGGGG AGGGAAAAGG 57 01 

AGGTGGGAAG GCAAGGAGGC CGGCCCGGTG GGGGCGGGAC CCGACTCGCA AACTGTTGCA 57 61 

TTTGCTCTCC ACCTCCCAGC GCCCCCTCCG AGATCCCGGG GAGCCAGCTT GCTGGGAGAG 5821 

CGGGACGGTC CGGAGCAAGC CCACAGGCAG AGGAGGCGAC AGAGGGAAAA AGGGCCGAGC 5 881 

TAGCCGCTCC AGTGCTGTAC AG GAG C C G AA GGGACGCACC ACGCCAGCCC CAGCCCGGCT 5 941 

CCAGCGACAG CCAACGCCTC TTGCAGCGCG GCGGCTTCGA AGCCGCCGCC CGGAGCTGCC 6001 

CTTTCCTCTT CGGTGAAGTT TTTAAAAGCT GCTAAAGACT CGGAGGAAGC AAGGAAAGTG 6061 
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CCTGGTAGGA 
GCCTTCCCCC 
ACCCCCCTCA 
CCCGAGTTTG 
CAAAGAAGGC 
CGCCTGGTTA 
CTCCTGCCTT 
TCTTCAGTAG 
TAATAACTCA 
TTTTGTTTTT 
ACAGACTGTG 
TTTGAGGCTG 
TCCCGCAGGT 
CTGAGCAAGA 
GGATG 



CTGACGGCTG 
CCTCCCCCGT 
CCACCCTTCT 
CAGAGAGGTA 
TCTTAGGAGC 
GGCTGCACGC 
CCCCACCCCG 
CCAAAAAACA 
GTTCTTATTT 
TTCTTTTAAG 
AGCCTAGCAG 
TCAGAGCGCT 
GGGCAGCTAG 
GAAGGGGAGG 



CCTTTGTCCT 
CTTCTCTCCC 
CCCCACCCGC 
ACTCCCTTTG 
CAGGCGACTG 
GGAGAGAACC 
AGTGCGGAGC 
AAACAAACAA 
GCACCTACTT 
ATCTGGGCAT 
GGCAGATCTT 
TTTTGCGTGG 
CTGCAGCGAC 
CGGGGTAAGG 



CCTCCTCTCC 
GCAGCTGCCT 
CCCCCCGCCC 
GCTGCGAGCG 
GGGAGCGGCT 
CTCTGTTTTC 
CAGAGATCAA 
AAACAAAAAA 
CAGTGGACAC 
CTTTTGAATC 
GTCCACCGTG 
TTGCTCCCGC 
TACCGCATCA 
GAAGTAGGTG 



ACCCCGCCTC 
CAGTCGGCTA 
CCGTCGCCCA 
GGCGAGCTAG 
TCAGCACTGC 
CCCCACTCTC 
AAGAT GAAAA 
CAAGAAATAA 
TGAATTTGGA 
TACCCTTCAA 
TGTCTTCTTC 
AAGTTTCCTT 
TCACAGCCTG 
GAAGATT C AG 



CCCCCACCCT 6121 
CTCTCAGCCA 6181 
GCGCTGCCAG 62 41 
CTGCACATTG 6301 
AGCCACGACC 6361 
TCTCCACCTC 64 21 
GGCAGTCAGG 64 81 
AAGAAAAAGA 6541 
AGGTGGAGGA 6 601 
GTATTAAGAG 6 661 
TGCACGAGAC 6721 
CTCTGGAGCT 67 81 
TTGAACTCTT 6841 
CCAAGCTCAA 6901 
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CA GGCCCCACAA AACCTAGATC TGCCCCAGTA TAACTAAATC 1501 
TGGGACCATT TATTGAGCAA TTATTATGTG CCAAGTATTG CGCTGAGTGC TTCCAGAGCA 15 61 
TTATCTCCTT TAACCCCAGC ATAGTATGTC AGATGCTGTT TTACAGATGA GCCAACTGAG 1621 

ACCAGAGATG CTCAGTCACT TGCCCAAGGT GACATGACTG ATATGGAATA G AGT C AAG AT 1681 
TTTTTTTTTT TTTTTTGACA CGGAGTCTCA CTCTGTCTCC CAGGCTGGAG TGCAGAGGCG 17 41 

CAATCTCAGC TCACTGCAAG CTCTGCCTCC CAGGTTCACG CATTCTCCTG CCTCAGCCTC 18 01 

CTGAGTAGCT GGGACTACAG GCACCCGCCA CCACACCTGG CTAATTTTTT GTATTTTTAG 18 61 

CAGAGACAGG GTTTCACCGT GTTAGCCAGG ATGGTCTCGA TCTCCTGACC TCGTGATCTG 1921 

CCTGCCTCGG CCTCCCAAAG TGATGGAATT ACAGGTGTGA GCCACCGCGA CTGGCCAGAT 1981 

TCAAGATTTG AACCCAGGTC CTCTTGGTCC CAGAGGCCCC TGTTTCTCAA CTCCCTAGCA 2 041 

TGCATACGCA CCTGTCCCTC TAGAGGTGCC TGCTTAAGTG TGCTCAGCAC ATGGAAGCAA 2101 

GTTAGAAATG CTAGGTATAC CTGTAAAGAG GTGTGGGAGA TGGGGGGGAG GGAAGAGAGA 2161 

AAGAGATGCT GGTGTCCTTC ATTCTCCAGT CCCTGATAGG TGCCTTTGAT CCCTTCTTGA 2 221 

CCAGTATAGC TGCATTCTTG GCTGGGGCAT TCCAACTAGA ACTGCCAAAT TTAGCACATA 22 81 

AAAATAAGGA GGCCCAGTTA AATTTGAATT TCAGATAAAC AATGAATAAT TTGTTAGTAT 2341 

AAATATGTCC CATGCAATAT CTTGTTGAAA TTAAAAAAAA AAAAAAAAGT CTTCCTTCCA 2 4 01 

TCCCCACCCC TACCACTAGG CCTAAGGAAT AGGGTCAGGG GCTCCAAATA GAATGTGGTT 2 4 61 

GAGAAGTGGA ATTAAGCAGG CTAATAGAAG GCAAGGGGCA AAGAAGAAAC CTTGAATGCA 2 521 

TTGGGTGCTG GGTGCCTCCT TAAATAAGCA AGAAGGGTGC ATTTTGAAGA ATTGAGATAG 2 581 

AAGTCTTTTT GGGCTGGGTG CAGTTGCTCG TGGTTGTAAT TCCAGCACTT TGGGAGGCTG 2 641 

AGGCGGGAGG ATCACCTGAG CTTGGGAGTT CAAGACCAGC CTCACCAACG TGGAGAAACC 2 7 01 

CTGTCTTTAC TAAAAATACA AAAAAT T C AG CTGGTCATGG TGGCACATGC CTGTAATCCC 2 7 61 

AGCTGCTCGG GAGGCTGAGG CAGGAGAATC ACTTGAACCA GGGAGGCAGA GGTTGTGGTG 2 821 

AGC AGAGAT C GCGCCATTGC TCTCCAGCCT GGGCAACAAG AGCAAAAGTT CGTTTAAAAA 2 881 

AAAAAAAAAG TCCTTTCGAT GTGACTGTCT CCTCCCAAAT TTGTAGACCC TCTTAAGATC 2 941 

ATGCTTTTCA GATACTTCAA AGATTCCAGA AGATATGCCC CGGGGGTCCT GGAAGCCACA 3001 

AGGTAAACAC AACACATCCC CCTCCTTGAC TATCAATTTT ACTAGAGGAT GTGGTGGGAA 30 61 

AACCATTATT TGATATTAAA ACAATAGGCT TGGGATGGAG TAGGATGCAA GCTCCCCAGG 3121 

AAGTTAGATA ACTGAGACTT AAAGGGTGTT AAGAGTGGCA GCCTAGGGAA ATTTATCCCG 3181 

GACTCCGGGG GAGGGGGCAG AGTCACCAGC CTCTGCATTT AGGGATTCTC CGAGGAAAAG 3241 

TGTGAGAACG GCTGCAGGCA ACCCAGGCGT CCCGGCGCTA GGAGGGACGA CCCAGGCCTG 3301 

CGCGAAGAGA GGGAGAAAGT GAAGCTGGGA GTTGCCGACT CCCAGACTTC GTTGGAATGC 33 61 

AGTTGGAGGG GGCGAGCTGG GAGCGCGCTT GCTCCCAATC ACCGGAGAAG GAGGAGGTGG 3421 

AGGAGGAGGG CTGCTTGAGG AAGTATAAGA ATGAAGTTGT GAAGCT GAGA TTCCCCTCCA 34 81 

TTGGGACCGG AGAAACCAGG GGAGCCCCCC GGGCAGCCGC GCGCCCCTTC CCACGGGGCC 3541 

CTTTACTGCG CCGCGCGCCC GGCCCCCACC CCTCGCAGCA CCCCGCGCCC CGCGCCCTCC 3 601 
CAGCCGGGTC CAGCCGGAGC CAT GG 
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